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© Intra-node pacing for high speed networks. 

© A method for reducing transmission polling over- 
head of packets within a node of a communications 
network. This invention involves setting session 
ready bits in the line adaptor modules of the node. 
These bits are then used to inform a given module 
as to which modules of the node have packets that 



are bound for the given module. A given module of 
the node will not transmit a pacing message (YT 
control message) to another module unless it knows 
that the other module has a packet to transmit to the 
given module. 
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INTRA-NODE PACING FOR HIGH SPEED NETWORKS 



Tec h nical Field 

This invention generally relates to a method of 
transmitting packets between line adaptor modules 
of a network node. More specifically, each module 
stores information (ready bits) as to which other 
modules of the node have the packets to transmit 
to it A given module of the node will not receive a 
signal (YT control message) from another module 
of the node unless the other module knows that the 
given module has a packet to transmit to the other 
module. 



Background of the Invention 

One of the sources of delay in the transmission 
of a packet through a multi-node communications 
network is the need to switch the packet at various 
nodes of the network. Such delays increase the 
need for additional bandwidth and buffer space. 

Many intra-node pacing techniques have been 
developed to reduce intra-node switching delays. 
One such method is described in the following 
reference: Andrew S. Tannenbaum, COMPUTER 
NETWORKS, 2nd ed.. Prentice Hall, 1988, Sec. 
2,7.1. 

All of the pacing techniques known in the art, 
however, are inefficient because of the overhead 
involved in transmitting from an input line adaptor 
module to a output line adaptor module of a switch- 
ing node. This inefficiency occurs because all of 
the pacing schemes in the art will send a polling 
signal to an incoming line adaptor module from an 
outgoing line adaptor module without knowing 
whether or not the incoming line adaptor module 
has packets to transmit to the outgoing line adaptor 
module. Therefore, a need exists for a more effi- 
cient intra-node pacing scheme to reduce delays 
and overhead in the transmission of packets 
through multi-node networks. 



Summary of the Invention 

It is, therefore, an object of this invention to 
provide a more efficient intra-node pacing tech- 
nique. 

More specifically, it is an object of this inven- 
tion to reduce polling overhead caused by switch- 
ing a packet from incoming module of a node to an 
outgoing module of the node. 

Accordingly, the present invention provides a 
method of transmitting packets between fine adapt- 
or modules of a network node. With this invention, 



a session ready (SR) control message is sent from 
one module of the node to another module of the 
node if the one module has at least one packet to 
send to the other module. When the other module 

s receives the SR control message, it sets a ready 
bit to a ready equals yes state, which indicates that 
the one module has least one packet in its buffers 
to transmit to the other module. If the ready bit in 
the other module is set to the ready equals yes 

10 state, then the other module transmits a your turn 
(YT) control message to the one module. Finally, 
the packet will be transmitted from the one module 
to the other module if the YT control message is 
received by the one module. 

75 

Brief Description of the Drawings 

Ffg. 1 is a schematic illustration of a node of a 
20 network showing the line adaptor modules, the 
switching fabric and the control modules. 
Fig. 2 is a schematic representation of the line 
adaptor module (LAM). 

Fig. 3 is schematic illustration of the incoming 
25 and outgoing session tables of the line adaptor 
modules. 

Fig. 4 is a schematic representation of the SR 
and YT control messages. Also shown are data 
packets with and without an SR control rnes- 
30 sage. 

Description of an Embodiment 



35 Overview of the High Speed Network Node Hard- 
ware " "* 

Referring to Fig. 1, each network node consists 
of several line adapter modules (LAMs) 5, control 

40 modules 7 and a, and a switching "fabric" 6, that 
provides any-to-any connectivity among the LAMs. 
The switching fabric can be a bus, crossbar switch, 
communications ring, optical switch or any other 
switching means. Also, shown are the control and 

45 data lines 25 between the LAMs and the switching 
fabric. An implementation of a Network Node must 
also include appropriate power supply, harness 
and/or connectors for attaching communications 
transmission lines to the LAMs, and a housing 

so cabinet and/or rack to hold all of the above. Fig. 1 
also shows the organization of the main compo- 
nents of a Network Node. Each LAM 5 connects to 
another network node via a transmission lines 15 
operating with an appropriate data link protocol, 
such as the standard HDLC protocol. One of the 



2 



EP 0 419 840 A2 



control modules 8 handles network control and 
management functions such as IBM's SNA-APPN. 
The other control module 7 connects to nodes 
operating with "foreign" protocols such as the stan- 
dard X25. In the following text, we elaborate on 
each of these components and functions. 

Referring to Fig. 2. each line adapter module 
(U\M) 5 has a communication link interface 23 
providing for communication with other Network 
Nodes or compatible End Nodes, and a switching 
interface 27 providing for communication with any 
of the other LAMs or control modules through the 
switching fabric within the same Network Node. 
The switch interface 27 provides the logic to inter- 
face the receive buffers 22 and send buffers 24 
with the switching fabric, while the communications 
link interface 23 provides the logic to interface the 
receive and send buffers with transmission line 15. 

A transmission line 15, which could typically 
carry a DS3 signal, may span a large physical 
distance, so that the propagation delay time for a 
packet to pass from one transmission line interface 
through a transmission line and arrive at the 
"other" end's transmission line interface may be 
several or many "packet transmission times." By 
packet transmission time we mean (packet size in 
bits)/(data rate of transmission interface in bits per 
second). 

On the other hand, all the components of a 
Network Node are in close physical proximity to 
one another, so that propagation delay time be- 
tween LAMs within the Network N connected by 
the switching fabric is less than one "packet trans- 
mission time." This property and the basic require- 
ment that a Network Node provide efficient routing 
and fair scheduling of data packets gives rise to 
the need for an intra-node pacing scheme to be 
practiced by LAMs communicating via the switch- 
ing fabric of a Network Node. 

Again referring to Fig. 2, besides the switch 
and transmission line interface, each LAM contains 
numerous packet buffers (see 22 and 24), control 
tables 26 and logic control 28. which, for example, 
could be a microprocessor such as INTEL 
80960CA. There are (at least) enough receive 
packet buffers 22 to accommodate the number of 
packets that can be transmitted on the LAM's 
transmission line during the "round" trip propaga- 
tion delay time to the LAM on the "other end". 
There are (at least) enough send packet buffers 24 
to accommodate the number of packets that can 
be transmitted during the propagation delay time of 
the switching fabric. The send and receive buffers 
could be standard S-RAMs. 

The control tables 28 and logic control 28 
completely provide for the execution of the network 
routing procedures, the link flow control protocols, 
and the intra-node pacing protocols. Once virtual 



circuits and buffer classes are "set up," packet 
flow is controlled completely by the LAMs, trans- 
mission lines, and the switching fabric hardware. 
Fig. 2 also shows the organization of a LAM. 

s The wide arrows 21 indicate the main data paths 
through the LAM, while the other arrows 29 indicate 
the flow of control information and signals. 

Referring back to Fig. 1, control module 8 may 
embody processors and storage required for Met- 
ro work Control and/or Network Management func- 
tions. Additionally, a control module 7 may contain 
interfaces to communication lines that are not 
"plug" compatible with LAMs. In this case the 
control module performs protocol conversion(s) to 

15 allow for connectivity to foreign networks, or end 
nodes utilizing physical, data link, or foreign net- 
work protocols. Each control module contains a 
switching "fabric" interface providing for exchange 
of information with any of the other LAMs or control 

20 modules within the same Network Node housing. 

We will refer to field x of packet p by the 
notation px For example, the type field of a packet 
named p is denoted by p.type. 

Fig. 3 shows the layout of some parts of the 

25 incoming and outgoing session tables that reside 
within each LAM of a high speed network node. 

The Incoming Session Table (1ST), shown in 
Ftg. 3.1 is indexed by the incoming session iden- 
tifier. There are entries for M incoming sessions 

30 which are numbered and whose identifiers are 
0,1,2 M-1. For each established incoming ses- 
sion the control logic of the LAM can rapidly look- 
up the corresponding outgoing session identifier (o- 
sid) and the outgoing LAM (O-LAM) to which data 

35 packets will be forwarded. Each entry of the Incom- 
ing Session Table also contains the number of 
packets queued for forwarding (g-size) and a repre- 
sentation of the queue of these packets (g-an- 
chors). The packets themselves are queued in the 

40 receive buffers area of the LAM. shown in Fig. 2. 
The Incoming Session Table also includes "other 
information" related to the windowing and inter- 
nodal pacing. 

We will refer to field x of the i'th entry of table 

45 T by the notation T.i.x. For example, the queue 
size field for session 3 in the Incoming Session 
Table 1ST is denoted by IST.3.g-size. 

The Outgoing Session Table (OST) shown in 
Fig. 3.2 is indexed by the outgoing session iden- 

50 tjffer. There are entries for N outgoing sessions 
which are numbered and whose identifiers are 

0,1,2 N-1. For each established outgoing session 

the control logic of the LAM can rapidly look-up the 
corresponding incoming session identifier (i-sid) 

55 and the incoming LAM (l-LAM) from which data 
packets will be arriving across the switching fabric. 
Each entry of the Outgoing Session Table also 
' contains a READY bit The Outgoing Session Table 
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also includes "other information" related to the 
windowing and inter-nodal pacing. 

Below, we disclose a new intra-node pacing 
scheme for hardware structures that have multiple 
processors and distributed buffers, such as the 
type described above. 

Consider the structure depicted in Fig. 2. Typi- 
cally, a data packet is received via a transmission 
line 15 and enqueued by a LAM 5. The incoming 
LAM (l-LAM) executes the link flow control and the 
intermediate routing function such as local identifier 
swapping. 

See A. Baratz, J. Gray, P. Green, J. Jaffe and 
0. Pozefsky, "SNA Networks of Small Systems, 9 
IEEE JSAC, vol. SAC-3, No. 3. May 1985. The 
packet will be transmitted by a different LAM, an 
outgoing LAM, (O-LAM), to the specified next net- 
work node. The transmission scheduling is per- 
formed at the O-LAM. For each selected session, 
the OLAM signals the corresponding l-LAM to 
forward a packet of the specified session. Only 
then can the l-LAM forward a data packet across 
the switching fabric. This signalling and packet 
delivery across the switching fabric is the termed 
intra-node pacing. 

The scheme of this invention maintains the 
READY bits by sending control messages between 
LAMs. We define two types of control messages, 
SR and YT, as follows: 

SR Session Ready: a control message sent by an 
l-LAM to an O-LAM, an SR indicates that one or 
more data packets are queued at the l-LAM for 
transmission on a particular session. An SR control 
message contains its (self-identifying) type and a 
session identifier. As we shall describe below, an 
SR may often be "piggybacked" along with a data 
packet transfer to reduce switching fabric overhead 
as shown in Fig. 4.4. An SR control message can 
also be sent in a separate packet; this message is 
then referred to as an isolated SR control message. 
See Fig. 4.1. 

YT Your Turn: a control message sent by an O- 
LAM to an l-LAM. A YT control message (see Rg. 
4.2) gives the "go ahead" to the l-LAM to pass the 
next queued data packet for a particular session 
across the switching fabric to the O-LAM. A YT 
control message contains its (self-identifying) type 
and a session identifier. 

A LAM performs scheduling only among its 
ready (READY YES) outgoing sessions. (With re- 
spect to these sessions it is the O-LAM) To sched- 
ule a packet for transmission on a selected ses- 
sion, the O-LAM sets the READY bit to NO and 
then sends a YT control message across the 
switching fabric to the corresponding l-LAM. Acting 
upon receipt of the YT control message, the l-LAM 
forwards the next queued data packet for the se- 
lected session across the switching fabric to the 0- 



LAM. If there are additional data packets already 
queued for the same session at the l-LAM, the I- 
LAM piggybacks an SR control message for the 
selected session. See Rg. 4.4. The piggybacked 

5 SR immediately follows (or is embedded within) the 
data packet Notice that the overhead of piggybac- 
king an SR with a data packet is negligible be- 
cause no extra switching fabric connection setup is 
required and the SR is very short, i.e., an SR 

w carries very few bits of information. If the session 
queue at the l-LAM becomes empty upon forward- 
ing the packet, no control message is piggybacked. 
See Rg. 4.3. Correspondingly, when the data pack- 
et is received by the O-LAM, READY is set to YES 

75 if an SR is piggybacked; otherwise, READY re- 
mains set to NO. 

In case an SR is not be piggybacked, then 
later, when the queue of the session at its Incoming 
LAM changes state from empty to non-empty, the 

20 l-LAM sends an "isolated" SR control message 
across the switching fabric to the corresponding 0- 
LAM, which will thereupon set READY to YES for 
the selected session. See Rg. 4.1. (This, of course, 
also describes the situation for the first data packet 

25 of the session &mdash. where READY must be 
initially set to NO in the O-LAM.) In this "worse" 
case, each data packet that traverses the switching 
fabric will be preceded by one exchange of SR and 
YT control messages &mdash. however, the SR 

30 and YT messages will typically contain very few 
bits compared to a data packet. Furthermore, this 
"worse" case only occurs when a session is 
"mostly Idlel" 

35 

Message Formats and Data Structures 

Rg. 4 shows the formats of some of the mes- 
sages that flow from LAM to LAM across the 

40 switching fabric of a high speed network node for 
data flow and intra-node pacing. Each message 
contains a type Held so that each message is self- 
describing. As discussed above there is an Isolated 
Session Ready control message (type l-SR). See 

45 Rg. 4.1. Data packets without Session Ready pig- 
gybacking are type D. See Rg. 4.3. We represent 
a data packet with Session Ready piggybacked as 
the separate type D-SR. See Rg. 4,4. An alter- 
native would be to have a separate field within (or. 

so trailing) a data packet to carry a Session Ready 
option bit 



Control Procedures 

The following describes those procedures ex- 
ecuted by the Intranodal pacing of data. Session 
setup and other supervisory procedures are also 
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executed by the control logic of the LAM. These 
set up and supervisory procedures are well known 
to those skilled in the art. 



Packet Arriving at LAM from Communication Link 
Interface ~ 

Whenever a data packet arrives from the com- 
munications link interface (see 23 of Fig.2) the w 
following procedure is executed: 

1. Let the Incoming Session Table be called 
1ST. Let the arriving packet be named P. Let 
I = P.session-id, the incoming session identifier 
number. Append P to the queue represented by 75 
IST.I.q-anchor. 

2. If the queue size IST.I.q-size changed from 0 
to 1 as a result of the previous step then build 
an Isolated SR control message called M, with 
M.type = l-SR and M. session-id = J where 20 
J = IST.I.o-sid, the outgoing session identifier of 

the incoming session I. Send message M across 
the switching fabric to the outgoing LAM iden- 
tified by iST.I.o-lam. 

3. Otherwise, if IST.I.q size did not change from 25 
0 to 1 , there is no need to send an Isolated SR 
message because the queue was non-empty 
before P arrived and we must have already 
informed the outgoing LAM of this fact 



packets for the same session queued 
"behind it". 

c. Otherwise, the queue has "gone empty", 
and the packet type is set as P.type = D. 

d. Set P.session-id = IST.I.o-sid. The session 
identity of the packet is changed in prepara- 
tion for its transmission on the outgoing com- 
munication link. 

e. Send packet P across the switching fabric 
to the outgoing LAM identified by IST.I.o-lam. 

4. T = D, a data packet without session ready 
piggybacked. 

Transfer packet P to the send buffers 24 pack- 
et shown in Fig. 2. The communications link 
interface (see 23 of Rg. 2) continually 
"services" the send buffers 24 by dequeuing 
available packets and transmitting them on the 
transmission line attached to the LAM. This 
packet does not affect the ready bit in the 
outgoing session table entry for the correspond- 
ing session. 

5. T = D-SR, a data packet with session ready 
piggybacked. 

Set the packet type to T = D. The session ready 
information shouldnt be passed to the commu- 
nications link interface. Transfer packet P to the 
send buffers. 

Set OST.I.READY = YES. Thus, the outgoing 
LAM for this session is now informed that there 
is at least one more packet queued at the in- 
coming LAM. 



Packet Arriving at LAM from Switch Interface 

Whenever a data packet arrives from the 
switch link interface the following procedure is ex- 
ecuted: 

1. Let the Incoming Session Table be called 
1ST. Let the Outgoing Session Table be called 
OST. Let the arriving packet be named P. Let 
I = P.session-id, be either the incoming or out- 
going session number of the packet, depending 
on its type. Let T = P.type. Execute one of the 
following steps depending on the packet type T. 

2. If T=I-SR. an isolated session ready mes- 
sage packet Set OST.I.READY = YES. The out- 
going LAM for this session is now informed that 
there is at least one packet queued at the in- 
coming LAM. 

3. If T = YT, a "your turn" control message pack- 
et. 

Perform the following (sub) steps. 

a. Dequeue a packet from the queue repre- 
sented by IST.I.q-anchor. Call It P. IST.I.q- 
size will be decremented by one as a side 
effect of this dequeuing. 

b. If IST.I.q-size is not zero, then set 
P.type = D-SR. This data packet will carry the 
"session ready" information that there are 



Communications Link Transmission Scheduling 

35 

The number of uncommitted packet buffers in 
the send buffers (See 24 of Rg. 2) is kept in a 
counter called NUX that is accessible to the logic 
control section (See 28 of Fig. 2) of the LAM. 

40 When the LAM is "initialized", NUX is set to the 
total number of packet buffers physically present in 
the send buffers area. Whenever a data packet 
departs from the LAM to the Communications Link 
interface, a signal is sent from the Communications 

45 Unk interface to the logic control section, where- 
upon the NUX counter is incremented by one. That 
is. whenever a packet leaves the LAM via the 
communication link interface the operation Set 
NUX = NUX +1 is executed. 

50 Whenever NUX > 0 and OST. J.READY = YES 
for some J, the logic control section of the LAM 
chooses one such J. The entry chosen can depend 
on the session priority or class of service and/or be 
arranged so as to give each "active session" fair 

66 access to the transmission facility of the LAM 
and/or be limited by the pacing considerations. 
Having chosen such J the control section executes 
the following procedure: 
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1. Set NUX= NUX-1. Count one packet send 
buffer as committed. 

2. Set 0ST.J.READY = NO. Reset the READY bit 
for outgoing session J. it will be flipped YES 
when another "session ready" indication arrives 
at this LAM. 

3. Build a "your turn" control message called M, 
with M.type = YTand M.session-id = OST.J,i-sid. 
Send message M across the switching fabric to 
the incoming LAM identified by OST.J.i-lam. 
This will cause the incoming LAM to Send the 
next data packet for the selected session across 
the switching fabric as described above. 



Claims 

1. A method of controlling the transmitting of pack- 
ets between line adaptor modules of a network 
node, comprising the steps of: 

(a) transmitting an Session Ready (SR) control 
message from one of said modules to another of 
said modules if said one module has at least 
one packet in its buffers to be transmitted to 
said other module; 

(b) setting a ready bit in said other module to a 
ready equals yes state if said SR control mes- 
sage is received by said other module, said 
ready equals yes state indicating that said one 
module has at least one packet in its buffers to 
transmit to said other module; 

(c) transmitting a Your Turn (YT) control mes- 
sage to said one module if said ready bit is set 
to said ready equals yes state in said other 
module; and • 

(d) transmitting a packet from said one module 
to said other module if said YT control message 
is received by said one module. 

2. Method as recited in claim 1, further comprising 
the step of setting the ready bit to a ready equals 
no state prior to transmitting said YT signal to said 
one module, in which ready equals no state, no 
further YT signal will be transmitted to said one 
module. 

3. Method as recited in claim 1 or 2, wherein said 
ready bit is stored in an outgoing session table of 
said other module, said table having N entries for 
N outgoing sessions with a ready bit being stored 
in each entry in either the ready equals yes or 
ready equals no state, with each entry of said table 
corresponding to one of said N outgoing sessions 
between a pair of said modules. 

4. Method as recited in claim 1, 2 or 3, wherein 
said SR control message is an isolated control 
message if said one module receives a packet to 
be transmitted to said other module after having no 
packets to be transmitted to said other module. 

5. Method as recited in claim 1 or in anyone of 



claims 2-4, wherein said SR control message is 
transmitted within a first data packet being trans- 
mitted from said one module when said one mod- 
ule still has more packets in its buffers to transmit 
5 to said other module after having released said first 
packet from one of said buffers. 

6. A method of transmitting packets between line 
adaptor modules of a network node, comprising the 
steps of. 

w (a) transmitting a Session Ready (SR) control 
message from one of said modules to another of 
said modules if said one module has at least 
one data packet in its buffers to be transmitted 
to said other module, said message being an 

75 isolated SR message if said one module re- 
ceives a data packet to be transmitted to said 
other module after having no data packets 
stored in its buffers to be transmitted to said 
other module, said message being transmitted 

20 as part of a first data packet if said other module 
still has at least one data packet, to be transmit- 
ted to said other module, in its buffers after said 
first data packet is released from said buffers; 
(b) setting a ready bit in said other module to a 

25 ready equals yes state if said SR control mes- 
sage is received by said other module, said 
ready equals yes state indicating that said one 
adaptor has at least one data packet in its 
buffers to transmit to said other module; 

30 (c) transmitting a Your Turn (YT) control mes- 
sage to said one module if said ready bit is set 
to said ready equals yes state in said other 
module; and 

(d) transmitting a data packet from said one 
35 module to said other module if said YT control 
message is received by said one module. 

7. A network node for controlling the transmission 
of packets between transmission lines in contact 
with said node comprising: 

40 a plurality of modules, each module in contact with 
at least one of said lines, each module comprising: 

(a) means for transmitting a Session Ready (SR) 
control message from one of said modules 
through said switching fabric to another of said 

45 modules if said one module has at least one 
packet in its buffers to be transmitted to said 
other module; 

(b) means for setting a ready bit in said other 
module to a ready equals yes state if said SR 

so control message is received by said other mod- 
ule, said ready equals yes state indicating that 
said one adaptor has at least one packet in its 
buffers to transmit to said other module; 

(c) means for transmitting a Your Turn (YT) 
55 control message through said switching fabric to 

said one module if said ready bit is set to said 
ready equals yes state in said other module; 
and 
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(d) means for transmitting a packet from said 
one module to said other module if said YT 
control message is received by said one mod- 
ule. 
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FIG.4.1 



FIG.4.2 



FIG.4.3 



ISOLATED SR CONTROL MESSAGE 



I = SR 



o-sid 



type session-Identifier 



YT CONTROL MESSAGE 



YT 



l-sld 



type session-identifier 



DATA PACKET WITHOUT SR PIGGYBACKED 



o-sid 



data 



type session-identifier session-data 



FIG.4.4 



DATA PACKET WITH" SR PIGGYBACKED 



D-SR 



o-sid 



data 



type session-identifier session-data 
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